Acetyl transferases and their use for producing carotenoids

ABSTRACT

The present invention relates to novel acetyl transferases, nucleic acid sequences coding therefore, expression constructs and vectors comprising these sequences, microorganisms transformed therewith and use thereof.

BACKGROUND OF THE INVENTION

The present invention relates to novel acetyl transferases, nucleic acid sequences coding therefore, expression constructs and vectors comprising these sequences, microorganisms transformed therewith and processes for the microbiological production of carotenoids, as for zeaxanthin, astaxanthin, lutein or β-cryptoxanthin.

Carotenoids are organic pigments ranging in color from yellow to red that are naturally produced by certain organisms, including photosynthetic organisms (e.g., plants, algae, cyanobacteria), and some fungi.

Carotenoids such as lutein, zeaxanthin or astaxanthin are important additives in the human and livestock diet as pigmenting substances and precursors of vitamin A derivatives. In addition, carotenoids have a health-promoting action such as enhancing the immune response and, by reason of their antioxidant properties, a cancer-preventing action, which makes their use as nutraceuticals of interest. An economic process for preparing carotenoids and foodstuffs with an increased carotenoid content is therefore of great importance. Particularly economic processes for preparing carotenoids are biotechnological processes which make use of proteins and biosynthesis genes of carotenoid biosynthesis from carotenoid-producing organisms.

SUMMARY OF THE INVENTION

The present invention is related to proteins or polypeptides, comprising the amino acid sequence SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or SEQ ID NO:14 respectively or a sequence derived from these sequences by substitution, insertion or deletion of amino acids and having a homology of at least 50% at the amino acid level with the sequence SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 or SEQ ID NO:14.

The inventors have surprisingly found that the incorporation of a gene encoding an acetyl transferase according to the invention into a host cell able to produce a specific carotenoid containing at least one hydroxyl group, as for example zeaxanthin or astaxanthin, alters carotenoid profiles in the host cells such that acetylated forms of the carotenoid are produced which allows an increased/enhanced accumulation of carotenoid compounds (acetylated plus non-acetylated forms) in the cell compared to strains not transformed with the gene encoding the protein or polypeptide according to the invention.

The present invention also relates to isolated polynucleotides encoding the polypeptides of the present invention, nucleic acid constructs, recombinant expression vectors, and recombinant host cells comprising the polynucleotides, and to methods of producing the polypeptides.

The present invention also provides improved systems, in particular transformed microorganisms for the biological production of carotenoids in which at least one heterologous polypeptide having acetyl transferase activity is expressed.

In one preferred example, the invention provides oleaginous fungi (including, for example, yeast) that produce one or more carotenoids. The present invention also provides methods of constructing such yeast and fungi, methods of using such yeast and fungi to produce carotenoids, and methods of preparing carotenoid-containing compositions, such as food or feed additives, or nutritional supplements, using carotenoids produced in such oleaginous yeast or fungi. In particular, the present invention provides systems and methods for generating yeast and fungi containing polynucleotides encoding the polypeptides of the present invention.

OVERVIEW OF SEQUENCE LISTING

SEQ ID NO:1 is the non-optimized DNA sequence encoding acetyl transferase ATF1 from S. cerevisiae.

SEQ ID NO:2 is the non-optimized DNA sequence encoding acetyl transferase ATF1 from S. bayanus.

SEQ ID NO:3 is the non-optimized DNA sequence encoding acetyl transferase ATF1 from S. mikatae.

SEQ ID NO:4 is the non-optimized DNA sequence of acetyl transferase ATF1 from S. kudriavzevii.

SEQ ID NO:5 is the amino acid sequence as deduced from SEQ ID NO:1.

SEQ ID NO:6 is the amino acid sequence as deduced from SEQ ID NO:2.

SEQ ID NO:7 is the amino acid sequence as deduced from SEQ ID NO:3.

SEQ ID NO:8 is the amino acid sequence as deduced from SEQ ID NO:4.

SEQ ID NO:9 is the DNA sequence encoding acetyl transferase ATF1 from S. cerevisiae as optimized for expression in Yarrowia lipolytica.

SEQ ID NO:10 is the DNA sequence encoding acetyl transferase ATF1 from S. bayanus as optimized for expression in Yarrowia lipolytica.

SEQ ID NO:11 is the DNA sequence encoding acetyl transferase ATF1 from S. mikatae as optimized for expression in Yarrowia lipolytica.

SEQ ID NO:12 is the DNA sequence encoding acetyl transferase ATF1 from S. kudriavzevii as optimized for expression in Yarrowia lipolytica.

SEQ ID NO:13 is the non-optimized DNA sequence encoding acetyl transferase ATF1 from S. arboricolus.

SEQ ID NO:14 is amino acid sequence as deduced from SEQ ID NO:13.

SEQ ID NO:15 is the DNA sequence encoding acetyl transferase ATF1 from S. arboricolus as optimized for expression in Yarrowia lipolytica.

SEQ ID NO:16 is the DNA sequence encoding acetyl transferase ATF1 from S. bayanus as optimized for expression in Paracoccus zeaxanthinifaciens using P. denitrificans PD1222 codon usage table.

SEQ ID NO:17 is the DNA sequence encoding acetyl transferase ATF1 from S. cerevisiae as optimized for expression in Paracoccus zeaxanthinifaciens using P. denitrificans PD1222 codon usage table.

SEQ ID NO:18: is the non-optimized DNA sequence of acetyl transferase ATF1 from S. cerevisiae with an internal NdeI site removed.

DEFINITIONS

Isolated polypeptide: The term “isolated polypeptide” means a polypeptide that is modified by the hand of man relative to that polypeptide as found in nature. In one aspect, the polypeptide is at least 1% pure, e.g., at least 5% pure, at least 10% pure, at least 20% pure, at least 40% pure, at least 60% pure, at least 80% pure, and at least 90% pure, as determined by SDS-PAGE.

Substantially pure polypeptide: The term “substantially pure polypeptide” means a preparation that contains at most 10%, at most 8%, at most 6%, at most 5%, at most 4%, at most 3%, at most 2%, at most 1%, and at most 0.5% by weight of other polypeptide material with which it is natively or recombinantly associated. Preferably, the polypeptide is at least 92% pure, e.g., at least 94% pure, at least 95% pure, at least 96% pure, at least 97% pure, at least 98% pure, at least 99% pure, at least 99.5% pure, and 100% pure by weight of the total polypeptide material present in the preparation. The polypeptides of the present invention are preferably in a substantially pure form. This can be accomplished, for example, by preparing the polypeptide by well known recombinant methods or by classical purification methods.

Sequence Identity: The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter “sequence identity”.

For purposes of the present invention, the degree of sequence identity between two amino acid sequences is determined using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably version 3.0.0 or later. The optional parameters used are gap open penalty of 10, gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix. The output of Needle labeled “longest identity” (obtained using the—nobrief option) is used as the percent identity and is calculated as follows:

(Identical Residues×100)/(Length of Alignment−Total Number of Gaps in Alignment)

For purposes of the present invention, the degree of sequence identity between two deoxyribonucleotide sequences is determined using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, supra) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 3.0.0 or later. The optional parameters used are gap open penalty of 10, gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NUC4.4) substitution matrix. The output of Needle labeled “longest identity” (obtained using the—nobrief option) is used as the percent identity and is calculated as follows:

(Identical Deoxyribonucleotides×100)/(Length of Alignment−Total Number of Gaps in Alignment)

Fragment: The term “fragment” means a polypeptide having one or more (several) amino acids deleted from the amino and/or carboxyl terminus of a mature polypeptide; wherein the fragment has acetyl transferase activity.

Allelic variant: The term “allelic variant” means any of two or more alternative forms of a gene occupying the same chromosomal locus. Allelic variation arises naturally through mutation, and may result in polymorphism within populations. Gene mutations can be silent (no change in the encoded polypeptide) or may encode polypeptides having altered amino acid sequences. An allelic variant of a polypeptide is a polypeptide encoded by an allelic variant of a gene.

Isolated polynucleotide: The term “isolated polynucleotide” means a polynucleotide that is modified by the hand of man relative to that polynucleotide as found in nature. In one aspect, the isolated polynucleotide is at least 1% pure, e.g., at least 5% pure, at least 10% pure, at least 20% pure, at least 40% pure, at least 60% pure, at least 80% pure, at least 90% pure, and at least 95% pure, as determined by agarose electrophoresis. The polynucleotides may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any combinations thereof.

Substantially pure polynucleotide: The term “substantially pure polynucleotide” means a polynucleotide preparation free of other extraneous or unwanted nucleotides and in a form suitable for use within genetically engineered polypeptide production systems. Thus, a substantially pure polynucleotide contains at most 10%, at most 8%, at most 6%, at most 5%, at most 4%, at most 3%, at most 2%, at most 1%, and at most 0.5% by weight of other polynucleotide material with which it is natively or recombinantly associated. A substantially pure polynucleotide may, however, include naturally occurring 5′ and 3′ untranslated regions, such as promoters and terminators. Preferably, the polynucleotide is at least 90% pure, e.g., at least 92% pure, at least 94% pure, at least 95% pure, at least 96% pure, at least 97% pure, at least 98% pure, at least 99% pure, and at least 99.5% pure by weight. The polynucleotides of the present invention are preferably in a substantially pure form.

Coding sequence: The term “coding sequence” means a polynucleotide which directly specifies the amino acid sequence of a polypeptide. The boundaries of the coding sequence are generally determined by an open reading frame, which usually begins with the ATG start codon or alternative start codons such as GTG and TTG and ends with a stop codon such as TAA, TAG, and TGA. The coding sequence may be a DNA, cDNA, synthetic, or recombinant polynucleotide.

cDNA: The term “cDNA” means a DNA molecule that can be prepared by reverse transcription from a mature, spliced, mRNA molecule obtained from a eukaryotic cell. cDNA lacks intron sequences that may be present in the corresponding genomic DNA. The initial, primary RNA transcript is a precursor to mRNA that is processed through a series of steps, including splicing, before appearing as mature spliced mRNA.

Nucleic acid construct: The term “nucleic acid construct” means a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or is modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature or which is synthetic. The term nucleic acid construct is synonymous with the term “expression cassette” when the nucleic acid construct contains the control sequences required for expression of a coding sequence of the present invention.

Control sequences: The term “control sequences” means all components necessary for the expression of a polynucleotide encoding a polypeptide of the present invention. Each control sequence may be native or foreign to the polynucleotide encoding the polypeptide or native or foreign to each other. Such control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the polynucleotide encoding a polypeptide.

Operably linked: The term “operably linked” means a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of a polynucleotide such that the control sequence directs the expression of the coding sequence.

Expression: The term “expression” includes any step involved in the production of the polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.

Expression vector: The term “expression vector” means a linear or circular DNA molecule that comprises a polynucleotide encoding a polypeptide and is operably linked to additional nucleotides that provide for its expression.

Host cell: The term “host cell” means any cell type that is susceptible to transformation, transfection, transduction, conjugation and the like with a nucleic acid construct or expression vector comprising a polynucleotide of the present invention. The term “host cell” encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication.

Variant: The term “variant” means a polypeptide having acetyl transferase activity comprising an alteration, i.e., a substitution, insertion, and/or deletion of one or more (several) amino acid residues at one or more (several) positions. A substitution means a replacement of an amino acid occupying a position with a different amino acid; a deletion means removal of an amino acid occupying a position; and an insertion means adding 1-3 amino acids adjacent to an amino acid occupying a position.

DETAILED DESCRIPTION OF THE INVENTION

Acetyl transferases hereinafter mean proteins or enzymes according to the invention which transfer an acetyl group to a carotenoid or carotenoid derivative containing at least one hydroxyl group, for example to zeaxanthin or astaxanthin, comprising the amino acid sequence SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 or SEQ ID NO:14 or a sequence derived from these sequences by substitution, insertion or deletion of amino acids and having a homology of at least 50% at the amino acid level with the sequence SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 or SEQ ID NO:14.

The amino acid sequence depicted in SEQ ID NO:5 is derived from translation of the cDNA sequence depicted in SEQ ID NO:1, the amino acid sequence depicted in SEQ ID NO:6 is derived from translation of the cDNA sequence depicted in SEQ ID NO:2, the amino acid sequence depicted in SEQ ID NO:7 is derived from translation of the cDNA sequence depicted in SEQ ID NO:3, the amino acid sequence depicted in SEQ ID NO:8 is derived from translation of the cDNA sequence depicted in SEQ ID NO:4, and the amino acid sequence depicted in SEQ ID NO:14 is derived from translation of the cDNA sequence depicted in SEQ ID NO:13.

Substitution means replacement of one or more amino acids by one or more amino acids. The replacements are preferably those called conservative, in which the replaced amino acid has a similar property to the original amino acid, for example replacement of Glu by Asp, Gln via Asn, Val by Ile, Leu by Ile, Ser by Thr.

Deletion is the replacement of an amino acid by a direct linkage. Preferred positions for deletions are the termini of the polypeptide and the linkages between the individual protein domains.

Insertions are introductions of amino acids into the polypeptide chain, there formally being replacement of a direct linkage by one or more amino acids.

The homology between two proteins means identity of the amino acids over the entire length of each protein, which is calculated by comparison with the aid of the computer program GAP (UWGCG, University of Wisconsin, Genetic Computer Group, program algorithm of Needleman and Wunsch (J. Mol. Biol. 1970, 48: 443-453), setting the following parameters:

-   -   Gap Weight: 12     -   Length Weight: 4     -   Average Match: 2.912     -   Average Mismatch: −2.003

A protein which has a homology of at least 50% at the amino acid level with the sequence SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 or SEQ ID NO:14 means a protein which, in comparison of its sequence with the sequence SEQ ID NO:5, SEQ ID NO:6, SEQ

ID NO:7, SEQ ID NO:8 or SEQ ID NO:14 using the above program algorithm with the above set of parameters, has an identity of at least 50%, preferably 60%, particularly preferably 70%.

The acetyl transferases can be prepared, as described hereinafter, by gene expression of the appropriate nucleic acids which encode these proteins from natural or genetically manipulated organisms.

The invention further relates to a process for transferring an acetyl group to a carotenoid or carotenoid derivative containing at least one hydroxyl group such as, for example, zeaxanthin, [beta]-cryptoxanthin, 3′-hydroxyechinenone, 3-hydroxyechinenone, adonixanthin (4-ketozeaxanthin), astaxanthin, phoenicoxanthin (adonirubin), [alpha]-cryptoxanthin or lutein or derivatives thereof having up to 40 C atoms. Preferably, such carotenoids or carotenoid derivatives contain at least one 3-hydroxy-[beta]-ionone or at least one 3-hydroxy-4-keto-[beta]-ionone or at least one 3-hydroxy-[epsilon]-ionone or at least one 3-hydroxy-4-keto-[epsilon]-ionone structural element in the molecule, such as, for example, 3-hydroxy-6-vinyl-[beta]-ionone, 3-hydroxy-4-keto-6-vinyl-[beta]-ionone, 3-hydroxyretinol, 3-hydroxy-4-ketoretinol, 3-hydroxyretinal, 3-hydroxy-4-ketoretinal, 3-hydroxyretinoic acid, 3-hydroxy-4-ketoretinoic acid or lutein.

The invention also relates to nucleic acid sequences coding for one of the acetyl transferases according to the invention. A preferred nucleic acid has the sequence SEQ ID NO:1, SQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 or SEQ ID NO:13.

The invention moreover relates to functional analogs of the nucleic acids according to sequence SEQ ID NO:1, SQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 or SEQ ID NO:13, obtained by addition, substitution, insertion and/or deletion of individual or multiple nucleotides, which furthermore code for an acetyl transferase having the desired specificity.

The invention also encompasses those nucleic acid sequences which comprise so-called silent mutations or which are modified in comparison with a specifically mentioned sequence in accordance with the codon usage of a specific origin or host organism, and naturally occurring variants of such nucleic acid sequences.

The invention also encompasses modifications of the nucleic acid sequences obtained by degeneration of the genetic code (i.e. without any changes in the corresponding amino acid sequence) or conservative nucleotide substitution (i.e. the corresponding amino acid is replaced by another amino acid of the same charge, size, polarity and/or solubility), and sequences modified by nucleotide addition, insertion, inversion or deletion, which sequences encode an acetyl transferase according to the invention having a “modified substrate profile”, and the corresponding complementary sequences.

The invention furthermore relates to expression constructs comprising a nucleic acid sequence according to the invention under the genetic control of regulatory nucleic acid sequences; and vectors comprising at least one of these expression constructs.

The invention also relates to a recombinant nucleic acid molecule. “Recombinant nucleic acid molecule” primarily refers to a nucleic acid molecule or nucleic acid sequence comprises nucleic acid molecules from two or more different genetic sources. According to the present invention, the recombinant nucleic acid molecule can be an expression construct, i.e., nucleic acid sequence encoding the polypeptide having acetyl transferase according to the invention, operatively linked to an expression control sequence, or the same nucleic acid sequence being integrated into the host chromosome.

Preferably, the constructs according to the invention encompass a promoter 5′-upstream of the encoding sequence in question and a terminator sequence 3′-downstream, and, optionally, further customary regulatory elements, and, in each case operatively linked with the encoding sequence. Operative linkage is to be understood as meaning the sequential arrangement of promoter, encoding sequence, terminator and, if appropriate, other regulatory elements in such a manner that each of the regulatory elements can fulfill its intended function on expression of the encoding sequence. Examples of operatively linkable sequences are targeting sequences, or else translation enhancers, enhancers, polyadenylation signals and the like. Further regulatory elements encompass selectable markers, amplification signals, replication origins and the like.

In addition to the artificial regulatory sequences, the natural regulatory sequence can still be present upstream of the actual structural gene. If desired, this natural regulation may be switched off by genetic modification, and the expression of the genes may be enhanced or lowered. However, the gene construct may also be simpler in construction, i.e. no additional regulatory signals are inserted upstream of the structural gene and the natural promoter with its regulation is not removed. Instead, the natural regulatory sequence is mutated in such a way that regulation no longer takes place and the gene expression is increased or reduced. One or more copies of the nucleic acid sequences may be present in the gene construct.

Examples of suitable promoters are: cos, tac, trp, tet, trp-tet, Ipp, lac, Ipp-lac, laclq, T7, T5, T3, gal, trc, ara, SP6, I-PR or I-PL promoter, which are advantageously employed in Gram-negative bacteria; and Gram-positive promoters amy and SPO2, the yeast promoters ADC1, MFa, Ac, P-60, CYC1, GAPDH, TEF1 or the plant promoters CaMV/35S, SSU, OCS, lib4, usp, STLS1, B33, nos or the ubiquitin or phaseolin promoter. Particular preference is given to using inducible promoters, for example light- and in particular temperature-inducible promoters, such as the PrP1 promoter.

In principle, all natural promoters with their regulatory sequences can be used. In addition, synthetic promoters may also be used in an advantageous fashion.

The above mentioned regulatory sequences are intended to allow the targeted expression of the nucleic acid sequences and of protein expression. Depending on the host organism, this may mean, for example, that the gene is expressed or over expressed only after induction has taken place, or that it is expressed and/or over expressed immediately and/or constitutively.

The regulatory sequences or factors can preferably have a positive effect on expression and in this manner increase or reduce the latter. Thus, an enhancement of the regulatory elements may advantageously take place at the transcriptional level by using strong transcription signals such as promoters and/or “enhancers”. In addition, translation may also be enhanced by improving, for example, mRNA stability.

An expression cassette is generated by fusing a suitable promoter with a suitable acetyl transferase nucleotide sequence and a terminator signal or polyadenylation signal. To this end, customary recombination and cloning techniques are used as they are described, for example, in T. Maniatis, E. F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989) and in T. J. Silhavy, M. L. Berman and L. W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1984) and in Ausubel, F. M. et al., Current Protocols in Molecular Biology, Greene Publishing Assoc. and Wiley Interscience (1987).

For expression in a suitable host organism, the recombinant nucleic acid construct or gene construct is advantageously inserted into a host-specific vector which allows optimal gene expression in the host. Vectors are well known to the skilled worker and can be found, for example, in “Cloning Vectors” (Pouwels P. H. et al., Ed., Elsevier, Amsterdam-New York-Oxford, 1985). Vectors are to be understood as meaning not only plasmids, but all other vectors known to the skilled worker such as, for example, phages, viruses, such as SV40, CMV, baculovirus and adenovirus, transposons, IS elements, plasmids, cosmids, and linear or circular DNA. These vectors can be replicated autonomously in the host organism or chromosomally.

The vectors according to the invention allow the generation of recombinant microorganisms which are transformed, for example, with at least one vector according to the invention and which can be employed for producing the mutants. The above-described recombinant constructs according to the invention are advantageously introduced into a suitable host organism and expressed. It is preferred to use usual cloning and transfection methods known to the skilled worker in order to bring about expression of the abovementioned nucleic acids in the expression system in question. Suitable systems are described, for example, in current protocols in molecular biology, F. Ausubel et al., Ed., Wiley Interscience, New York 1997.

Suitable host organisms are, in principle, all organisms which allow expression of the nucleic acids according to the invention, their allelic variants, and their functional equivalents or derivatives. Preferred initial organisms are those naturally able to synthesize carotenoids. However, initial organisms able to synthesize carotenoids because of the introduction of carotenoid biosynthesis genes are also suitable. Initial organisms mean prokaryotic or eukaryotic organisms such as, for example, microorganisms or plants. Preferred microorganisms are bacteria, yeasts, algae or fungi.

Therefore, the invention further relates to a process for preparing the genetically modified organisms described below, wherein the acetyl transferase genes according to the invention are introduced into the genome of the initial organism. By initial organisms are meant the organisms before the genetic modification according to the invention.

The acetyl transferase genes according to the invention can in principle be introduced by all methods known to the skilled worker into the initial organisms described below, which are genetically modified thereby.

They are advantageously introduced into the initial organisms or cells thereof by transformation, transfection, conjugation, electroporation, using the so-called particle gun, or by microinjection.

The skilled worker can find appropriate methods for microorganisms in the textbooks by Sambrook, J. et al. (1989) Molecular cloning: A laboratory manual, Cold Spring Harbor Laboratory Press, by F. M. Ausubel et al. (1994) Current protocols in molecular biology, John Wiley and Sons, by D. M. Glover et al., DNA Cloning Vol. 1, (1995), IRL Press (ISBN 019-963476-9), by Kaiser et al. (1994) Methods in Yeast Genetics, Cold Spring Harbor Laboratory Press or Guthrie et al. Guide to Yeast Genetics and Molecular Biology, Methods in Enzymology, 1994, Academic Press.

Examples of advantageous methods which may be mentioned are those such as the introduction of the DNA by homologous or heterologous recombination, for example using the URA3 gene, specifically the URA3 gene from Ashbya, as described in the German Application DE 19801120.2, and/or by the REMI method (=“restriction enzyme mediated integration”) which is described below.

The REMI technique is based on the cotransformation of a linear DNA construct which has been cut at both ends with the same restriction endonuclease, together with the restriction endonuclease which was used for this restriction of the DNA construct, into an organism. The restriction endonuclease then cuts the genomic DNA of the organism into which the DNA construct has been introduced together with the restriction enzyme. This leads to an activation of the cell's own repair mechanisms. These repair mechanisms repair the strand breaks in the genomic DNA which have been caused by endonuclease, and during this also incorporate with a certain frequency the cotransformed DNA construct into the genome. Ordinarily, the restriction cleavage sites are retained at both ends of the DNA during this.

This technique was described by Bolker et al. (Mol. Gen. Genet. 1995, 248: 547-552) for the insertion mutagenesis of fungi. The method was used by Von Schiestl and Petes (Proc. Natl. Acad. Sci. USA, 1991, 88: 7585-7589) to find out whether there is heterologous recombination in Saccharomyces. The method has been described by Brown et al. (Mol. Gen. Genet. 1996, 251: 75-80) for the stable transformation and regulated expression of an inducible reporter gene.

It is possible using the REMI method to position the nucleic acid fragments according to the invention or the aforementioned acetyl transferase genes according to the invention at transcriptionally active sites in the genome.

It is possible and advantageous to clone the nucleic acids together with at least one reporter gene into a DNA construct, which is introduced into the genome. This reporter gene ought to make detectability easy by a growth, fluorescence, chemo- or bioluminesence assay or by a photometric measurement. Examples which may be mentioned of reporter genes are antibiotic resistance genes, hydrolase genes, fluorescent protein genes, bioluminescence genes, glucosidase genes, the luciferase gene, [beta]-galactosidase gene, gfp gene, lipase gene, esterase gene, peroxidase gene, [beta]-lactamase gene, acetyl-, phospho- or adenyltransferase gene. These genes make it possible easily to measure and quantify the transcription activity and thus the expression of the genes. This means that it is possible to identify sites in the genome which have a productivity differing by up to a factor of 2.

If it is intended to introduce a plurality of genes, such as, for example, further genes of carotenoid biosynthesis, into the organism, they can all be introduced together with a reporter gene in a single vector, or each individual gene with a reporter gene can be introduced in one vector in each case, into the organism, it being possible to introduce the various vectors at the same time or successively. It is also possible to insert gene fragments coding for the respective activities using the REMI techniques.

Restriction enzymes suitable in principle for integrating the acetyl transferase genes or nucleic acid constructs according to the invention into the genome of initial organisms are all known to the person skilled in the art. Restriction enzymes which recognize only 4 base pairs as restriction cleavage site are less preferred because they cut too often in the genome or in the vector to be integrated, and preferred enzymes recognize 6, 7, 8 or more base pairs as cleavage site, such as BamHI, EcoRI, BgilII, SphI, SpeI, XbaI, XhoI, NcoI, SalI, ClaI, KpnI, Hind/III, SacI, PstI, BpnI, NotI, SrfI or SfiI, to mention only a few of the possible enzymes. It is advantageous if the enzymes used no longer have cleavage sites in the DNA to be introduced; this increases the efficiency of integration. Ordinarily, 5 to 500 U, preferably 10 to 250, particularly preferably 10 to 100 U of the enzymes are used in the REMI mixture. The enzymes are advantageously employed in an aqueous solution which contains substances for osmotic stabilization, such as sugars such as sucrose, trehalose or glucose, polyols such as glycerol or polyethylene glycol, a buffer with an advantageous buffering in the range of pH 5 to 9, preferably 6 to 8, particularly preferably 7 to 8, such as tris, MOPS, HEPES, MES or PIPES and/or substances to stabilize the nucleic acids, such as inorganic or organic salts of Mg, Cu, Co, Fe, Mn or Mo. It is also possible where appropriate for other substances to be present, such as EDTA, EDDA, DTT, [beta]-mercaptoethanol or nuclease inhibitors. However, it is also possible to carry out the REMI technique without these additions.

The process is carried out at a temperature in the range from 5 to 80° C., preferably from 10 to 60° C., particularly preferably from 20 to 40° C. Other known methods for destabilizing cell membranes are suitable for the process, such as, for example, electroporation, fusion with loaded vesicles or destabilization with various alkali metal or alkaline earth metal salts such as lithium, rubidium or calcium salts, with lithium salts being preferred.

The invention further relates to a correspondingly genetically modified organism, with the expression of the acetyl transferase genes according to the invention being increased by comparison with a wild type organism in the case where the initial organism contains an acetyl transferase gene, or being caused in the case where the initial organism does not contain an acetyl transferase gene, by the genetic modification.

A genetically modified organism means an organism in which the acetyl transferase gene(s) or nucleic acid construct(s) according to the invention have been inserted, preferably by one of the methods described above.

The genetically modified organism contains at least one acetyl transferase gene according to the invention or at least one nucleic acid construct according to the invention. Depending on the initial organism, the nucleic acid may be present inside or outside the chromosome.

Carotenoid metabolism in the genetically modified organisms is preferably altered by comparison with the wild type.

Preferred organisms are recombinant bacteria, plants, fungi or yeast. In a particular embodiment, the recombinant fungus is oleaginous in that it can accumulate lipid to at least about 20% of its dry cell weight; and produces at least one carotenoid selected from the group consisting of antheraxanthin, adonirubin, adonixanthin, astaxanthin, canthaxanthin, capsorubrin, β-cryptoxanthin, α-carotene, δ-carotene, ε-carotene, echinenone, 3-hydroxyechinenone, 3′-hydroxyechinenone, γ-carotene, Ψ-carotene, 4-keto-γ-carotene, ζ-carotene, α-cryptoxanthin, β-cryptoxanthin, deoxyflexixanthin, diatoxanthin, 7,8-didehydroastaxanthin, didehydrolycopene, fucoxanthin, fucoxanthinol, isorenieratene, β-isorenieratene, lactucaxanthin, lutein, lycopene, myxobactone, neoxanthin, neurosporene, hydroxyneurosporene, peridinin, phytoene, rhodopin, rhodopin glucoside, 4-keto-rubixanthin, siphonaxanthin, spheroidene, spheroidenone, spirilloxanthin, torulene, 4-keto-torulene, 3-hydroxy-4-keto-torulene, uriolide, uriolide acetate, violaxanthin, zeaxanthin-β-diglucoside, zeaxanthin, a C30 carotenoid, and combinations thereof, and can accumulate the produced carotenoid to at least about 1% of its dry cell weight. Even more preferably, the recombinant fungus is a member of a genus selected from the group consisting of: Aspergillus, Blakeslea, Botrytis, Candida, Cercospora, Cryptococcus, Cunninghamella, Fusarium (Gibberella), Kluyveromyces, Lipomyces, Mortierella, Mucor, Neurospora, Penicillium, Phycomyces, Pichia (Hansenula), Puccinia, Pythium, Rhodosporidium, Rhodotorula, Saccharomyces,

Sclerotium, Trichoderma, Trichosporon, Xanthophyllomyces (Phaffia), and Yarrowia, or is of a species selected from the group consisting of: Aspergillus terreus, Aspergillus nidulans, Aspergillus niger, Blakeslea trispora, Botrytis cinerea, Candida japonica, Candida pulcherrima, Candida revkaufi, Candida tropicalis, Candida utilis, Cercospora nicotianae, Cryptococcus curvatus, Cunninghamella echinulata, Cunninghamella elegans, Fusarium fujikuroi (Gibberella zeae), Kluyveromyces lactis, Lipomyces starkeyi, Lipomyces lipoferus, Mortierella alpina, Mortierella ramanniana, Mortierella isabellina, Mortierella vinacea, Mucor circinelloides, Neurospora crassa, Phycomyces blakesleanus, Pichia pastoris, Puccinia distincta, Pythium irregulare, Rhodosporidium toruloides, Rhodotorula glutinis, Rhodotorula graminis, Rhodotorula mucilaginosa, Rhodotorula pinico/a, Rhodotorula gracilis, Saccharomyces cerevisiae, Sclerotium rolfsii, Trichoderma reesei, Trichosporon cutaneum, Trichosporon pullulans, Xanthophyllomyces dendrorhous (Phaffia rhodozyma), and Yarrowia lipolytica.

Of these naturally oleaginous strains, some also naturally produce carotenoids and some do not; these strains may be additionally utilized as a host cell by introduction of carotenoid biosynthesis genes as disclosed in U.S. Pat. No. 7,851,199.

In a particular embodiment, the recombinant bacterium is gram negative or positive. Gram-positive bacterial hosts include, but are not limited to, Bacillus, Brevibacillus, Clostridium, Geobacillus, Lactobacillus, Lactococcus, Paenibacillus, and Streptomyces. Gram-negative bacteria include, but are not limited to E. coli, Pseudomonas and Paracoccus. The recombinant bacterial host may be any Bacillales including, but not limited to, Bacillus amyloliquefaciens, Brevibacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus lentus, Bacillus licheniformis, Geobacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis. The recombinant bacterial host may also be any Streptomyces including, but not limited to, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans. The recombinant bacterial host may also be any Paracoccus including, but not limited to Paracoccus denitrificans, Paracoccus versutus, Paracoccus carotinifaciens, Paracoccus marcusii, and Paracoccus zeaxanthinifaciens.The recombinant bacterium produces at least one carotenoid selected from the group consisting of antheraxanthin, adonirubin, adonixanthin, astaxanthin, canthaxanthin, capsorubrin, β-cryptoxanthin, α-carotene, δ-carotene, ε-carotene, echinenone, 3-hydroxyechinenone, 3′-hydroxyechinenone, γ-carotene, ψ-carotene, 4-keto-γ-carotene, ζ-carotene, α-cryptoxanthin, deoxyflexixanthin, diatoxanthin, 7,8-didehydroastaxanthin, didehydrolycopene, fucoxanthin, fucoxanthinol, isorenieratene, β-isorenieratene, lactucaxanthin, lutein, lycopene, myxobactone, neoxanthin, neurosporene, hydroxyneurosporene, peridinin, phytoene, rhodopin, rhodopin glucoside, 4-keto-rubixanthin, siphonaxanthin, spheroidene, spheroidenone, spirilloxanthin, torulene, 4-keto-torulene, 3-hydroxy-4-keto-torulene, uriolide, uriolide acetate, violaxanthin, zeaxanthin-β-diglucoside, zeaxanthin, a C30 carotenoid, and combinations thereof.

In other embodiments, the present invention provides a method of producing a carotenoid, the method comprising steps of cultivating a fungus or bacterium under conditions that allow production of the carotenoid; and isolating the produced carotenoid.

Cultivation of the genetically modified organism according to the invention takes place in a manner known per se, such as cultivation of the appropriate wild type, for example in the case of microorganisms in a suitable medium such as, for example, on agar plates or in suspension culture, or in the case of plants in soil or appropriately suitable nutrient media. By harvesting is meant in the case of microorganisms the isolation of the microorganisms, and in the case of plants the cutting off of the plant or, where appropriate, particular plant parts containing the carotenoids. The carotenoids are isolated in a manner known per se, for example by disruption of the organism cells, extraction of the carotenoids and subsequent purification of the carotenoids by chemical or physical separation methods such as extraction or chromatography.

The following examples illustrate the invention.

EXAMPLES

Table 1 below describes certain Yarrowia lipolytica strains used in the following exemplification:

TABLE 1 Yarrowia lipolytica strains. Strain Number Genotype How Constructed ML9863 MATB erg9-4789::ura3 Classical and standard {HMG-tr GGS carB molecular genetic carRP crtW Xa-crtZ techniques Dc-crtZ} prototrophic ML9335 MATA erg9-4789::ura3 Classical and standard {HMG-tr GGS carB molecular genetic carRP crtW Dc-crtZ} techniques prototrophic ML12526 ML9335 plus extra copies Untargeted transformations of HMG-tr carB 3X-carRP followed by removal of Hyg^(R) and Nat^(R) using cre-lox system ML11218 ML9863 crtW-Δ6180 Targeted disruption with Hyg^(R) cassette; subsequent marker removal using cre-lox system

Yarrowia strains ML9863 and ML9335 were constructed by the introduction of heterologous genes under the control of constitutive promoters, coupled with several generations of crossbreeding, starting with ML350 and ATCC201249 as described in U.S. Pat. No. 7,851,199. The GGS gene and the truncated HMG gene (“HMG-tr”) were derived from Yarrowia sequences corresponding to native geranylgeranyl pyrophosphate synthase and hydroxymethylglutaryl-CoA reductase genes, respectively. The carRP and carB genes were derived from Mucor circinelloides, and they encode a bifunctional phytoene synthase/lycopene cyclase and a phytoene dehydrogenase, respectively. The crtW gene was synthesized to encode the carotene ketolase of Parvularcula bermudensis. The crtZ gene was amplified from Xanthobacter autotrophicus (Xa), or synthesized to encode the carotene hydroxylase of Cronobacter pulveris (formerly known as Enterobacter pulveris) (Ep) or Enterobacteriaceae bacterium DC404 (Dc). These genes are sometimes but not always associated with auxotrophic markers (URA3, LEU2, URA2, LYS1, ADE1) or a loxP site, remnant of a Hyg^(R) or Nat^(R) marker.

TABLE 2 Plasmids. Plasmid Backbone Insert Source pMB6532 pMB6157 (Hyg^(R)) Sc-ATF1 Synthesized NheI - MluI fragment pMB6563 pMB6532 Sc-ATF1 (2^(nd) copy) Synthesized NheI - MluI fragment pMB6608 pMB6563 Nat^(R) pMB6200 pMB6732 pMB6157 Sb-ATF1 Synthesized NheI - MluI fragment pMB6733 pMB6157 Sk-ATF1 Synthesized NheI - MluI fragment pMB6812 pMB6157 Sa-ATF1 Synthesized NheI - MluI fragment pMB6655 pMB6157 Hyg^(R) pMB6674 pMB6157 Hyg^(R) pMB6769 pMB6655 Sb-ATF1 Synthesized NheI - MluI fragment pMB6771 pMB6674 Sb-ATF1 Synthesized NheI - MluI fragment pMB6832 pMB6771 (Hyg^(R)) Sb-ATF1 - Sb-ATF1 pMB6732 pMB7006 pRK415 Tet^(R) Ref paper? pMB6976 pRK-PcrtE-crtE Sb ATF1 (wild type) Synthesized NdeI - HindIII fragment pMB6977 pRK-PcrtE-crtE Sc-ATF1 (wild type with Synthesized NdeI - HindIII NdeI site in ORF removed) fragment pMB6978 pRK-PcrtE-crtE Sb-ATF1 Synthesized NdeI - HindIII fragment pMB6979 pRK-PcrtE-crtE Sc-ATF1 Synthesized NdeI - HindIII fragment

All basic molecular biology and DNA manipulation procedures described herein are generally performed according to Sambrook et al. or Ausubel et al. (J. Sambrook, E. F. Fritsch, T. Maniatis (eds). 1989. Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press: New York; F. M. Ausubel, R. Brent, R. E. Kingston, D. D. Moore, J. G. Seidman, J. A. Smith, K. Struhl (eds.). 1998. Current Protocols in Molecular Biology. Wiley: New York).

Example 1a Production of pMB6532 , Encoding S. Cerevisiae Acetyltransferase ATF1.

The ATF1 gene of Saccharomyces cerevisiae was codon optimized according to Yarrowia codon bias, and the DNA fragment specified in SEQ ID No: 9 was synthesized de nova During the de novo synthesis the sequence 5′-TGCTAGCCACAAAA, containing an NheI restriction site and a typical Kozak sequence for enabling efficient translation, was added immediately upstream of the ATG. The sequence ACGCGT-3′, comprising an MluI restriction site, was added immediately downstream of the stop codon. This sequence was cleaved using NheI and MluI and ligated to pMB6157 cut with NheI and MluI to produce pMB6532. The resulting protein encoded by the Sc-ATF1 gene of pMB6532 is specified in SEQ ID No: 5. This plasmid was subsequently cleaved with EcoRV, and the Sc-ATF1-containing cassette duplicated by the insertion of a 2.35 kb SspI-PvuII fragment from the same plasmid, to create pMB6563, which encodes converging transcripts of Sc-ATF1 flanking a Hyg^(R) marker. The Nat^(R) marker, conferring nourseothricin resistance, was used to replace Hyg^(R) in this plasmid by cleaving pMB6563 with SspI and BamHI and ligating it to the 1.3 kb SspI-BamHI fragment from pMB6200, to create pMB6608.

Example 1b Introduction of S. Cerevisiae Acetyltransferase ATF1 into a Yarrowia Strain Capable of Producing Zeaxanthin

Strain ML11218 was transformed with a PvuII fragment of MB6563 comprising two copies of Sc-ATF1 under the control of constitutive promoters and a selectable marker for hygromycin resistance, Hyg^(R). Ten hygromycin resistant transformants were chosen from the transformation plate (YPD+100 mg/L hygromycin) after 3-4 days of growth at 30° C. Most transformants produced between 14 and 17% of mono-acetylated Zeaxanthin and between 42 and 57% of di-acetylated zeaxanthin (as a percentage of total zeaxanthin) when grown in YPD for 4 days at 30° C. Free zeaxanthin production ranged between 27 and 42% of total zeaxanthin. One strain, ML12641, was chosen for further engineering.

Strain ML12641 was transformed with XbaI-treated MB6608, comprising two copies of Sc-ATF1 under the control of constitutive promoters and a selectable marker for nourseothricin, Nat^(R). Ten nourseothricin resistant transformants were chosen from the transformation plate (YPD+100 mg/L nourseothricin) after 3-4 days of growth at 30° C. Several transformants produced between 9 and 16% mono-acylated and between 56 and 80% di-acetylated zeaxanthin (as a percentage of total zeaxanthin) when grown in YPD for 4 days at 30° C. Free zeaxanthin production ranged between 11 and 28% of total zeaxanthin. One strain, ML12735, was chosen for further analysis and for cultivation in a fed-batch fermentor.

Strain ML12735 was grown in a fermentor using a fed-batch process. Total (free plus esterified) zeaxanthin production increased two-fold when compared to strain ML11218, which does not carry any copies of the Sc-ATF1 gene. Additionally, FIG. 1 illustrates that zeaxanthin production in strain ML11218 increases sharply early in the fermentation, then plateaus as zeaxanthin production ceases. In contrast, even though strain ML12735 has a similar production profile early in the fermentation process, zeaxanthin production continues to increase past the limit reached by ML11218, and the fermentation maintains its productivity for an extended period, unlike strain ML11218.

Example 1c Introduction of S. Cerevisiae Acetyltransferase ATF1 into a Yarrowia Strain Capable of Producing Astaxanthin

Strain ML12526 was transformed with MB6563 (which contains converging transcripts of ATF1 flanking a Hyg^(R) marker) that had been treated with XbaI. Five hygromycin resistant transformants were chosen from the transformation plate (YPD+100 mg/L hygromycin) after 3-4 days of growth at 30° C. After subsequent growth for 3-4 days in YPD shake flasks at 30° C., transformants produced between 22 and 29% of mono-acetylated and between 16 and 56% of di-acetylated astaxanthin (as a percentage of total astaxanthin). Free astaxanthin production ranged between 19 and 59% of total astaxanthin. One strain, ML12707, was chosen for further analysis and for cultivation in a fed-batch fermentor.

Strain ML12707 was grown in a fermentor using a fed-batch process. FIG. 2 illustrates that the rate of astaxanthin production in strain ML12707 is higher earlier in the fermentation and reaches a higher titer at the end of the run as compared to strain ML12562, which does not carry any copies of the Sc-ATF1 gene.

Example 2a Production of pMB6732, pMB6733, and pMB6812, Encoding S. Bavanus, S. Kudriavzevii, and S. Arboricolus Acetyltransferase ATF1, respectively.

Plasmids were generated for the expression of ATF1 genes from different Saccharomyces species as described in Table 2. The ATF1 genes of Saccharomyces bayanus (Sb), Saccharomyces kudriavzevii (Sk), and Saccharomyces arboricolus (Sk) were codon optimized according to Yarrowia codon bias, and the DNA fragments specified in SEQ ID No: 10, 12, and 15, respectively, were synthesized de nova During the de novo synthesis of the different ATF1 genes the sequence 5′-TGCTAGCCACAAAA, containing an NheI restriction site and a typical Kozak sequence for enabling efficient translation, was added immediately upstream of the ATG. The sequence ACGCGT-3′, comprising an MluI restriction site, was added immediately downstream of the stop codon. The sequences were cleaved using NheI and MluI and ligated to pMB6157 cut with NheI and MluI to produce pMB6732, pMB6733, and pMB6812, respectively. The resulting proteins encoded by the ATF1 genes of pMB6732, pMB6733, and pMB6812 are specified in SEQ ID No: 6, 8, and 14, respectively.

Example 2b Production of pMB6832, Harboring Two Copies of S. Bavanus Acetyltransferase ATF1

S. bayanus ATF1 was also inserted as described above into two other vectors bearing constitutive promoters (pMB6655 and pMB6674), to create pMB6769 and pMB6771, and the two distinct cassettes were combined, via the transfer of a PvuII-SspI fragment from pMB6769 bearing one cassette into EcoRV-cleaved pMB6771, bearing the other cassette and a hygromycin resistant marker, to yield pMB6832.

Example 2c Introduction of Acetyltransferase ATF1 Genes from S. bavanus, S. kudriavzevii and S. arboricolus into a Yarrowia Strain Capable of Producing Zeaxanthin

Strain ML11218 was independently transformed with MB6732 (Sb-ATF1), MB6733 (Sk-ATF1) and MB6812 (Sa-ATF1) which had been treated with XbaI. Ten hygromycin resistant transformants of each plasmid were chosen from the transformation plate (YPD+100 mg/L hygromycin) after 3-4 days of growth at 30° C. After subsequent growth for 3-4 days in YPD shake flasks at 30° C., transformants were analyzed for zeaxanthin production and compared to the control strain, ML11218. Transformants harboring the S. bayanus ATF1 (pMB6732) produced between 4 and 16% mono-acetylated zeaxanthin and between 46 and 85% di-acetylated zeaxanthin (as a percentage of total zeaxanthin). Free zeaxanthin production ranged between 8 and 38% of total zeaxanthin. One transformant harboring S. kudriavzevii ATF1 (pMB6733) produced about 3% mono-acetylated zeaxanthin and no di-acetylated zeaxanthin (as a percentage of total zeaxanthin). Transformants harboring the S. arboricolus ATF1 (pMB6812) produced between 20 and 22% mono-acetylated zeaxanthin and between 30 and 45% di-acetylated zeaxanthin (as a percentage of total zeaxanthin). Free zeaxanthin production ranged between 34 and 49% of total zeaxanthin.

S. bayanus ATF1 was chosen for further studies and to investigate its acetylation capability in fermentors. Strain ML11218 was transformed with a PvuII fragment of pMB6832, harboring two copies of S. bayanus ATF1. Six hygromycin resistant transformants were chosen from the transformation plate (YPD+100 mg/L hygromycin) after 3-4 days of growth at 30° C. After subsequent growth for 3-4 days in YPD shake flasks at 30° C., transformants produced between 5 and 6% mono-acetylated zeaxanthin and between 83 and 90% di-acetylated zeaxanthin (as a percentage of zeaxanthin). Free zeaxanthin production ranged between 4 and 14% of total zeaxanthin. One strain, ML13129, was chosen for further analysis and for cultivation in a fed-batch fermentor.

Strain ML13129 was grown in a fermentor using a fed-batch process. Total (free plus esterified) zeaxanthin production increased 1.8-fold when compared to strain ML11218, which does not carry any copies of the Sb-ATF1 gene. Additionally, FIG. 3 illustrates that zeaxanthin production in both strains has a similar production profile early in the fermentation, but production in ML11218 plateaus and zeaxanthin production ceases. In contrast, zeaxanthin production continues to increase in strain ML13129 past the limit reached by ML11218 approaching almost double its levels.

Example 2d Introduction of Acetyltransferase ATF1 Genes from S. Bavanus and S. Kudriavzevii into a Yarrowia Strain Capable of Producing Astaxanthin

Strain ML12526 was independently transformed with MB6732 (Sb-ATF1) and MB6733 (Sk-ATF1) which had been treated with XbaI. Ten hygromycin resistant transformants of each plasmid were chosen from the transformation plate (YPD+100 mg/L hygromycin) after 3-4 days of growth at 30° C. After subsequent growth for 3-4 days in YPD shake flasks at 30° C., transformants were analyzed for astaxanthin production and compared to the control strain, ML12526. Transformants harboring the Sb-ATF1 plasmid pMB6732 produced between 16 and 30% mono acetylated astaxanthin and between 57 and 76% di acetylated astaxanthin (as a percentage of total astaxanthin). Free astaxanthin production ranged between 8 and 37% of total astaxanthin. Transformants harboring the Sk-ATF1 plasmid pMB6733 produced between 9 and 24% mono acetylated astaxanthin and between land 6% di acetylated astaxanthin (as a percentage of total astaxanthin). Free astaxanthin production ranged between 70 and 90% of total astaxanthin. One strain, ML12819, harboring the Sb-ATF1 plasmid pMB6732, was chosen for further analysis and for cultivation in a fed-batch fermentor. Strain ML12819 was grown in a fermentor using a fed-batch process. FIG. 4 illustrates that the rate of astaxanthin production in strain ML12819 is similar compared to strain ML12562, which does not carry the Sb-ATF1 gene, but reaches a higher titer at the end of the run.

Example 3a Production of pMB6976, pMB6977, pMB6978 and pMB6979 Encoding Native and Codon Optimized S. bavanus and S. cerevisiae ATF1

S. bayanus and S. cerevisiae ATF1 genes were codon optimized for expression in Paracoccus sp. strain R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114) (Patent US20070202579 Al, U.S. Pat. No. 7,232,679 B2) using the Paracoccous denitrificans PD1222 codon usage table, and the DNA fragments specified in SEQ ID No: 16 and SEQ ID No: 17, respectively, were synthesized de novo. Non codon optimized S. bayanus and S. cerevisiae ATF1 genes specified in the SEQ ID No: 2 and SEQ ID No: 18 were also synthesized de novo. The wild type S. cerevisiae ATF1 gene contains an NdeI site that was removed during de novo synthesis. During the de novo synthesis the sequence 5′-CAT was added immediately upstream of the ATG to create an NdeI restriction site. The sequence AAGCTT-3′, comprising a HindIII restriction site, was added immediately downstream of the stop codon. Both codon optimized and non-codon optimized versions of S. bayanus and S. cerevisiae ATF1 were cloned under the control of the crtE promoter of Paracoccus using the plasmid pRK-PcrtE-crtE (derived from pRK415, Keen, et al., Gene. 1988, 70: 191-197) that had been cleaved with NdeI and HindIII to produce pMB6976 (Sb-ATF1 wild type), pMB6977 (Sc-ATF1 wild type), pMB6978 (Sb-ATF1 codon optimized) and pMB6979 (Sc-ATF1 codon optimized). The resulting protein encoded by the Sb-ATF1 genes in pMB6976 and pMB6978 is specified in SEQ ID No: 6. The resulting protein encoded by the Sc-ATF1 genes in pMB6977 and pMB6979 is specified in SEQ ID No: 5.

Table 3 below describes the E. coli strains used in the following exemplification:

TABLE 3 E. coli strains. Strain Gene introduced How constructed MB7006 none Transformation MB7007 Sb-ATF1 (wild type) Transformation MB7008 Sc-ATF1 (wild type) Transformation MB7009 Sb-ATF1 (codon optimized) Transformation MB70010 Sc-ATF1 (codon optimized) Transformation

Table 4 below describes the Paracoccus strains used in the following exemplification:

TABLE 4 Paracoccus zeaxanthinifaciens strains. How Strain Gene introduced constructed R114/pBBR-K-mev-op-R114-PcrtE- none Conjugation with MB7006 (Rif^(R) crtE_(R114) + pMB7006 and Tet^(R) selection) R114/pBBR-K-mev-op-R114-PcrtE- Sb-ATF1 wild type Conjugation with MB7007 (Rif^(R) crtE_(R114) + pMB7007 and Tet^(R) selection) R114/pBBR-K-mev-op-R114-PcrtE- Sc-ATF1 wild type Conjugation with MB7008 (Rif^(R) crtE_(R114) + pMB7008 and Tet^(R) selection) R114/pBBR-K-mev-op-R114-PcrtE- Sb-ATF1 codon optimized Conjugation with MB7009 (Rif^(R) crtE_(R114) + pMB7009 and Tet^(R) selection) R114/pBBR-K-mev-op-R114-PcrtE- Sc-ATF1 codon optimized Conjugation with MB7010 (Rif^(R) crtE_(R114) + pMB7010 and Tet^(R) selection)

Example 3b Introduction of S. Bavanus and S. Cerevisiae ATF1 into a Paracoccus Strain Capable of Producing Zeaxanthin Via Conjugation

Plasmid pMB6975 (pRK415 control plasmid) and plasmids pMB6976, pMB6977, pMB6978 and pMB6979 harboring the different ATF1 genes were transformed into E. coli strain S17-1 to produce strains MB7706, MB7007, MB7708, MB7709 and MB7710 . E. coli strain S17-1 is a mobilization host containing transfer genes in its chromosome. The vector pRK415 is used as the expression plasmid and harbors the transfer genes necessary to move it into Paracoccus. Plasmids pMB6975-pMB6979 were individually introduced into Paracoccus sp. strain R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114) (Rif^(R)) via conjugation with E. coli strains MB7706-MB7710 and selection on 100 mg/L rifampicin and 2.5 mg/L tetracycline. The Paracoccus exconjugants created were named R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7006, R114/+pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7007, R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7008, R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7009, and R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7010.

Six Paracoccus exconjugants harboring the different ATF1 genes and six exconjugants harboring the control plasmid were selected and grown on F-medium (10 g/l tryptone, 10 g/l yeast extract, 30 g/l NaCl, 10 g/l D-glucose, 5 g/l MgSO4.7H2O, pH 7.0) at 28° C. at 200 rpm for 24 hours and analyzed for carotenoid production. One milliliter of broth was harvested, spun down, and the cell pellet was extracted for carotenoids as described for Yarrowia lipolytica samples. All Paracoccus strains harboring optimized and non-optimized S. bayanus and S. cerevisiae ATF1 genes produced acetylated zeaxanthin and β-cryptoxanthin in addition to free zeaxanthin, β-cryptoxanthin, and carotenes. The control strain, R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7006, without ATF1, does not produce acetylated zeaxanthin or acetylated β-cryptoxanthin.

Two typical Paracoccus exconjugants, R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7008-11 and R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7010-9, harboring the wild type and the codon optimized ATF1 gene from S. cerevisiae, respectively, were chosen for more detailed analysis and comparison to the control strain, R114/pBBR-K-mev-op-R114-PcrtE-crtE_(R114)+pMB7006-10. Strains were grown on F-medium as described above. Five hundred microliters of whole broth were harvested and lyophilized for 48 hours. Carotenoids were extracted and analyzed as described for Yarrowia lipolytica samples. FIG. 5 shows that ATF1-harboring strains produced about 10% mono-acetylated and about 4% di-acetylated zeaxanthin (as a percentage of total zeaxanthin. The ATF1-harboring strains also produced 74% and 65% more hydroxylated product (acetylated and free zeaxanthin, and βcryptoxanthin) than the control strain (55% and 43% more total zeaxanthin).

Example 4 Extraction and Quantification of Carotenoid Production by HPLC from Yarrowia Lipolytica and Paracoccus Cells

Shake flask testing and carotenoid analysis of generated strains were performed according to the methods described previously in U.S. Pat. No. 7,851,199 B2.

For quantification of acetylated carotenoids from Yarrowia and Paracoccus by HPLC and HPLC DAD MS the following methods were used:

Normal Phase Carotenoid Method

A Waters 1525 binary pump attached to a Waters 717 auto sampler was used to inject samples. A Phenomenex Luna 3 μ Silica (2), 150×4.6 mm column with a security silica guard column kit was used to resolve carotenoids. Synthetic carotenoid samples, purchased from CaroteNature (GmbH, Im Budler 8, CH-4419 Lupsingen, Switzerland) or received from DSM Nutritional Products Ltd., were used as reference standards. Acetylated compounds of astaxanthin and zeaxanthin were synthesized based on experiments reviewed in Kaewkoola and Krisnangkura, Chem Phys Lipids. 2010, 163: 685-688 and Kaewkool, et al., Eur. J. Lipid Sci. Technol. 2009, 111: 474-480. Approximately, 100 mg/L of carotenoid was dissolved in ethyl acetate and excess base of either sodium hydroxide or potassium hydroxide was added. Samples were allowed to sit in dark at room temp and analyzed periodically from 1-5 days. Synthesized acetylated components were then used as retention time markers, but quantitation is based on non-acetylated compound. All other acetylated compounds, except from zeaxanthin and astaxanthin, were identified by UV spectral features only. The mobile phase consisted of 1000 mL hexane, 30 mL isopropanol, and 0.1 mL acetic acid for astaxanthin-related compounds, or 1000 mL hexane, 60 mL isopropanol, and 0.1 mL acetic acid for zeaxanthin-related compounds. The flow rate for each run was 0.6 mL per minute. Column temperature was ambient. The injection volume was 20 μL. The detector was a photodiode array detector collecting from 210 to 600 nm.

A typical chromatogram for zeaxanthin related compounds using this method is shown in FIG. 6:

-   1: carotenes -   2: acetylated β-cryptoxanthin -   3: di-acetylated zeaxanthin -   4: β-cryptoxanthin -   5: mono-acetylated zeaxanthin -   6: zeaxanthin

A typical chromatogram for astaxanthin related compounds using the above method is shown in FIG. 7:

-   1: carotenes -   2: di-acetylated adonixanthin -   3: mono-acetylated adonixanthin -   4: di-acetylated astaxanthin -   5: adonirubin -   6: mono-acetylated astaxanthin -   7: astaxanthin

HPLC DAD MS method

For determination of acetylated zeaxanthin by HPLC DAD MS, samples were re-suspended in ice cold extraction solvent (50/50 v/v mix of hexane and ethyl acetate containing 0.01% butyl-hydroxy-toluene (BHT). An Alliance 2795 HPLC (Waters) system equipped with a Waters X-Bridge C18 column (3.5 μm, 2.1×50 mm) and a Thermo Basic 8 guard column (2.1×10 mm) was used to resolve carotenoids at 25° C. Authentic carotenoid samples were used as standards. The mobile phase and flow rates are shown below (Solvent A=Ethyl Acetate; Solvent B=Water; Solvent C=Methanol; Solvent D=Acetonitrile). The injection volume was 10 μL. The detector was a Waters 996 photodiode array detector in tandem with a Micro Mass Quattro Micro mass spectrometer. The mass spectrometer was run in default settings for single ion monitoring in positive ion mode. The cone voltage used was 35 V. The retention time for zeaxanthin was 1.09 minutes, maximum absorbance at 450 nm with mono isotopic mass 569.4 in positive ion mode. The retention time of the mono-acetylated zeaxanthin was 2.68 minutes, maximum absorbance at 450 nm with mono isotopic mass 611.4 in positive ion mode. The retention time of the di-acetylated zeaxanthin was 3.08 minutes, maximum absorbance at 450 nm with mono-isotopic mass 653.4 in positive ion mode. The retention time for other carotenoids was: β-cryptoxanthin, 3.2 minutes, lycopene, 3.6 minutes, y-carotene, 3.8 minutes and β-carotene, 3.95 minutes.

Flow Rate and Mobile Phase Gradient Time (min) Flow (mL/min) % A % B % C % D Curve 0.0 0.5 0 20 0 80 6 3.0 1.0 20 0 0 80 6 4.5 1.0 80 0 20 0 6 5.0 0.9 0 0 100 0 6 6.0 0.9 0 0 100 0 6 6.5 0.9 0 20 0 80 6 7.0 0.5 0 20 0 80 6 

1. A polypeptide having acetyl transferase activity, selected from the group consisting of: a. a polypeptide of SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7 SEQ ID NO:8 or SEQ ID NO:14; b. a polypeptide derived from SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 or SEQ ID NO:14 by substitution, insertion or deletion of amino acids and having a homology of at least 50% at the amino acid level with the sequence of SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 or SEQ ID NO:14.
 2. A polypeptide of claim 1, wherein the protein has an enzymatic activity for converting a carotenoid molecule having (a) at least one β-ionone and/or at least one ε-ionone ring, and (b) at least one ring-associated hydroxyl group, into the corresponding, partially or fully acetylated carotenoid molecule.
 3. A polypeptide of claim 2, wherein the protein has an enzymatic activity for converting zeaxanthin, astaxanthin, lutein, and β-cryptoxanthin into zeaxanthin mono- or diacetate, astaxanthin mono- or diacetate, lutein mono- or di-acetate, and β-cryptoxanthin acetate, respectively.
 4. An isolated nucleic acid encoding the polypeptide of claim
 1. 5. The isolated nucleic acid of claim 4, which consists of the sequence depicted in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 or SEQ ID NO:13.
 6. A nucleic acid construct or expression vector comprising the polynucleotide of claim 4 or 5 operably linked to one or more (several) control sequences that direct the production of the polypeptide in an expression host cell.
 7. A transformed microorganism which is able to produce carotenoids, in which at least one heterologous polypeptide having acetyl transferase activity is expressed.
 8. A transformed microorganism according to claim 7, wherein the nucleic acid of claim 4 or 5 or the nucleic acid construct of claim 6 is expressed.
 9. The transformed microorganism of claim 7 or 8, for the production of carotenoids, whose carotenoid metabolism is different from that of a wild type.
 10. The transformed microorganism of any one of claims 7 to 9, wherein said microorganism is an oleaginous strain.
 11. The transformed microorganism of claim 10, wherein the said oleaginous strain is a strain of Yarrowia lipolytica.
 12. The transformed microorganism of any of the claims 7 to 9, wherein said microorganism is a bacterium.
 13. The transformed microorganism of claim 12, wherein said bacterial strain is a strain of Paracoccus zeaxanthinifaciens.
 14. A process for producing the transformed microorganism of any of claims 7 to 13, which comprises introducing a nucleic acid or nucleic acid construct consisting of the sequence depicted in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 or SEQ ID NO:13 which is functionally linked to one or more regulation signals.
 15. Use of a transformed microorganism of any of claims 7 to 13 for the production of carotenoids. 